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SUMMARY 


In  this  report  we  present  a  brief  description  of  the  research  carried 
out  by  faculty,  staff,  and  students  in  the  M.I.T.  Laboratory  for  Information 
and  Decision  Systems  under  Grant  AFOSR-28-0258.  The  principal  investigator 
for  this- research  is  Professor  Alan  S.  Willsky,  and  the  co-principal 
investigator  is  Professor  George  C.  Verghese.  The  time  period  covered  in 
this  status  report  is  from  July  1,  1983  through  June  30,  1984. 

The  basic  scope  of  this  grant  is  to  carry  out  fundamental  research  in 
the  analysis,  control,  and  estimation  of  complex  systems  with  particular 
emphasis  on  the  use  of  methods  of  asymptotic  analysis  and  multiple  time 
scales  to  decompose  complex  problems  into  interconnections  of  simpler  ones. 
During  the  time  period  covered  by  this  report  significant  progress  has 
been  made  in  several  areas,  leading  to  important  results  and  to  promising 
directions  for  further  research. 

V 

The  specific  topics  covered  in  this  report  are: 

I.  Analysis  of  Systems  Possessing  Multiple  Time  Scales 

II.  Control  and  Estimation 

In  January  1984  the  renewal  proposal  was  written  for  this  Grant  for  the  time 
period  July  1,  1984  through  June  30,  1985.  As  part  of  this  proposal  we  included 
a  detailed  progress  report  for  the  time  period  January  1,  1983  through 
December  30,  1983.  Since  this  earlier  report  covers  one  half  of  the  time  period 
for  the  present  report,  we  have  included  it  as  an  Appendix  to  this  report. 

We  refer  heavily  to  the  Appendix  and  confine  ourselves  to  a  brief  updating  of 


our  research  progress. 


I.  Analysis  of  Systems  Possessing  Multiple  Time  Scales 

In  the  Appendix  we  describe  some  of  the  results  of  our  research  in 
constructing  multiple  time  scale  decompositions  of  linear  systems  of  the  form 
x(t)  =  A(e)x(t)  (1.1) 

based  on  performing  the  Smith  Decomposition  of  the  matrix  A(e) .  In  the  Appendix 
we  identified  numerous  research  problems  that  were  opened  up  by  our  results 
through  the  end  of  1983.  In  the  first  half  of  1984  we  have  made  significant 
progress  on  several  of  these: 

We  have  developed  a  numerical  method  for  computing  the  Smith 
Decomposition  of  A(e)  and  the  required  Schur  complements  to  obtain 
the  time  scale  decomposition  of  (1.1).  Further  numerical  testing 
and  modifications  are  no  doubt  necessary,  but  this  work  provides 
us  with  a  very  useful  research  tool.  This  tool  has  already  proved 
useful  in  our  initial  testing  of  several  approaches  for  practical 
time  scale  decomposition  of  systems  without  an  explicitly  identi¬ 
fied  small  parameter  t  (see  Problems  (2)  and  (5)  of  Section  I 
of  the  Appendix) . 

•  It  is  in  Problems  (6)  and  in  particular  (7)  of  Section  I  of  the 
Appendix  that  we  have  made  the  most  significant  progress.  In 
particular  we  have  made  precise  the  relationship  between  the 
multiple  semisimple  nullstructure  (MSSNS)  condition  and  the 
orders  of  the  eigenvalues  and  invariant  factors  of  Ate).  In 
addition  when  the  MSSNS  condition  is  violated  we  posed  in  (7) 
the  problem  of  determining  a  scaling  transformation  so  that  the 
transformed  version  of  A(e)  does  have  MSSNS.  At  present  we  have 
a  very  strong  conjecture  for  this  problem  which  we  have  verified 
for  low-  (up  to  4-)  dimensional  problems.  The  detailed,  complete 
proof  of  this  result  should  be  available  soon,  and  in  the  process 
we  expect  to  make  explicit  contact  with  the  Jordan  form  and 
invariant  subspace  structure  of  A(e).  As  indicated  in  the  Appendix 
such  a  scaling  result  will  also  allow  us  to  analyze  problems  of 
singular  or  cheap  control  and  estimation. 


We  have  also  made  significant  progress  along  a  somewhat  different 
line  for  obtaining  the  time  scale  decomposition  of  (1.1)  in  the 
case  when  (1.1)  represents  the  evolution  of  probabilities  for  a 
finite-state  Markov  process  (FSMP) .  In  our  earlier  work  we  had 
applied  our  previously  developed  methods  to  the  case  of  FSMP’s,  but 
the  results  were  not  completely  satisfactory  since  the  required 
computations  were  quite  complex.  We  have  at  present  developed 
an  alternative  procedure  which  is  far  simpler  computationally. 

The  procedure  requires  that  transient  states  at  any  particular 
time  scale  have  a  particular,  restrictive  transition  structure, 
but  we  have  also  developed  a  method  for  modifying  any  FSMP  so 
it  possesses  this  property.  We  have  not  yet  completed  the  proof 
of  the  validity  of  this  method,  but  this  should  be  accomplished 
shortly.  In  the  process  we  expect  to  shed  significant  light 
both  on  Problem  (3)  of  Section  I  of  the  Appendix  which  deals  with 
"allowable  modifications*  of  A(e) ,  i.e.  perturbations  that  do 
not  change  the  time  scale  structure,  and  on  the  detailed  structure 
of  singularly  perturbed  FSKP^s. 


II.  Control  and  Estimation 

In  this  area  we  have  made  progress  in  our  investigation  of  Problem  (3) 
of  Section  II  of  the  Appendix.  Specifically  we  have  obtained  results  on  the 
modification  of  the  open  loop  time  scales  of 

x(t)  =  A(e)xCt)  +  B(e)u(.t)  (2.1) 

using  state  feedback 

u(t)  =  K(e)x(t)  (2.2) 

-1  -2 

where  K(e)  may  have  terms  of  the  form  £  ,  e  ,  etc.  In  addition,  as  pointed 

out  in  the  preceding  section,  our  work  on  scaling  transformations  are  of 
direct  applicability  to  the  problems  of  singular  and  cheap  control  described 
in  Problem  (5)  of  Section  II  of  the  Appendix. 

Most  of  our  effort  in  estimation  and  control  in  the  last  6  months, 
however,  has  been  in  our  research  on  estimation  for  a  class  of  complex  FSMP's 
whose  structure  and  qualitative  properties  have  been  motivated  by  the  problems 
of  automated  analysis  of  electrocardiograms  (ECG*s) .  In  the  Appendix  we 
describe  the  general  nature  of  this  class  of  FSMP's  which  typically  consist  of 
interconnections  of  several  subprocesses  which  can  influence  the  transition 
behavior  of  each  other,  we  also  discuss  the  interpretation  of  the  observed 
signals  as  encodings  of  the  discrete  sequence  of  FSMP  states,  and  we  described 
our  objective  of  developing  and  examining  estimator  structures  consisting  of 
compatibly  interconnected  sets  of  decoders ,  each  charged  with  estimating  the 
state  trajectory  sequence  of  one  of  the  subprocesses.  A  challenging  problem 


here  is  the  determination  of  how  to  connect  these  local  decoders,  i.e.  in 
deducing  precisely  what  information  about  one  subprocess  should  be  transmitted 
to  the  estimator  of  another  subprocess.  It  is  clear  that  some  type  of 
information  should  be  transmitted  to  a  particular  subprocess  estimator  from 
estimators  for  other  subprocesses  which  either  affect  or  are  affected  by 
the  particular  subprocess;  however  the  identification  of  precisely  what 
information  is  useful  is  a  challenging  problem.  In  addition,  in  order  to 
make  use  of  such  information  each  subprocess  estimator  must  have  an  aggregate 
model  of  how  its  subprocess  interacts  with  other  subprocesses .  Again  the 
issue  of  how  to  perform  such  aggregate  modeling  is  a  challenging  problem. 

At  present  Mr.  P.  Doerschuk  is  completing  his  Ph.D.  thesis  in  this 
problem  area  (see  ref.  [10]  of  the  Appendix) ,  and  in  his  research  Mr.  Doerschuk 
has  shed  significant  light  on  the  issues  just  described  as  well  as  the 
issue  of  how  one  measures  performance  of  such  an  estimator  (see  Section  II  of 
the  Appendix  for  a  discussion  of  this  problem).  Mr.  Doerschuk hs  research  in 
the  first  half  of  this  year  has  focused  on  a  series  of  increasingly  complex 
process  models  consisting  of  two  interacting  subprocesses.  These  models  re¬ 
present  simplifications  of  the  dynamic  behavior  of  a  normal  heart  and  a  heart 
subject  to  spurious  ventricular  contractions  using  Mr.  Doerschuk 's  previously- 
developed  framework  for  ECG  modeling.  In  addition  to  the  present  empirical 
studies  of  Mr.  Doerschuk  we  are  also  in  the  process  of  developing  computational 
methods  for  bounding  and  approximating  probabilities  of  error  and  related 
performance  measures  for  complex  decision  problems  such  as  those  that  arise 
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SUMMARV 

In  this  report  we  present  a  brief  description  of  the  research  carried 
out  by  faculty,  staff,  and  students  in  the  M.I.T.  Laboratory  for  Information 
and  Decision  Systems  under  Grant  AFOSR-82-0258 .  The  principal  investigator 
for  this  research  is  Professor  Alan  S.  Willsky,  and  Prof.  George  C.  Verghese 
serves  as  a  senior  researcher  on  the  project.  The  time  period  covered  in 
this  status  report  is  from  January  1,  1983  to  December  31,  1983. 

The  basic  scope  of  this  grant  is  to  carry  out  fundamental  research  in 
the  analysis,  control,  and  estimation  of  complex  systems,  with  particular 
emphasis  on  the  use  of  methods  of  asymptotic  analysis  and  multiple  time 
scales  to  decompose  complex  problems  into  interconnections  of  simpler  ones . 
During  the  time  period  covered  by  this  report,  significant  progress  has 
been  made  in  several  areas,  leading  to  important  results  and  to  promising 
directions  for  further  research. 

The  specific  topics  covered  in  this  report  are: 

I.  Analysis  of  Systems  Possessing  Multiple  Time  Scales. 

II.  Control  and  Estimation  . 

A  list  of  publications  supported  by  this  grant  is  also  included.  We  refer 
heavily  to  these  papers  and  reports  for  detailed  technical  developments. 


I.  Analysis  of  Systems  Possessing  Multiple  Time  Scales 

During  the  past  year  we  have  made  significant  progress  in  this  portion 
of  our  research.  Our  recent  work  in  this  area  is  described  in  more  detail 
in  [7],  [9],  [11].  This  work  has  focused  on  the  examination  of  systems  of 
the  form 

X (t)  =  A(£) x(t)  (1.1) 

where  £  is  a  small  parameter.  In  our  earlier  work  [1] ,  [4]  we  had  developed 
a  method  for  determining  when  it  is  possible  to  construct  and  for  construct¬ 
ing  an  approximation  to  the  solution  of  (1.1) ,  of  the  form 

At  A  £t  A  £rt 

x(t)  -  e  e  •••  e  r  x(0)  +  0(e)  (1.2) 

where  A.jA_.  =  0  if  i  /  j  and  (1.2)  is  uniformly  valid  on  0  <  t  <  °°.  Thus 
(1.2)  says  that  one  can  construct  an  £-independent  similarity  transfor¬ 
mation  £  =  Tx  which  puts  the  system  (1.1)  into  a  block-decoupled  form  which 
explicitly  decomposes  the  system  into  subsystems  evolving  at  different 
time-scales  —  i.e. 

F  t  F  £t  F  Ert 

C(t)  =  diag  (e  °  ,  e  ,•••,  e  r  )£( 0)  +  0(£)  (1.3) 

The  motivation  for  this  line  of  research  came  from  a  desire  to  extend 
the  results  and  approaches  developed  by  others,  including  Kokotovic, 
Campbell,  O'Malley,  Chow,  Hoppensteadt ,  Khalil  and  Sannuti,  who  have 
demonstrated  the  utility  of  time-scale  decompositions  for  system  analysis, 
approximation,  control,  and  estimation.  The  starting  point  for  much  of 
this  work  is  a  model  of  the  form  of  (1.1)  but  in  which  the  time-scale 
structure  is  explicitly  displayed.  For  example,  consider  the  two-time- 
scale  model  studied  extensively  by  Kokotovic  and  co-workers: 


(The  form  actually  used  in  many  of  these  other  works  can  be  brought  to  (1.4) 
by  a  simple  change  of  time-scale) .  For  this  model  there  is  an  extremely 
straightforward  procedure  for  determining  if  a  two-time-scale'  decomposition 
exists  and,  if  so,  for  computing  the  fast  and  slow  parts  of  the  dynamics  — 
i.e.  for  constructing  the  similarity  transformation  which  puts  the  system 
into  the  form  of  (1.3)  (with  r  =  2) .  From  this  perspective  we  can  view 
the  contribution  of  [1]  as  determining  the  existence  of  and  constructing 
such  transformations  when  the  time-scale  behavior  of  the  system  is  not 
evident  by  inspection.  The  price  that  was  paid  in  [2],  however,  was  a 
rather  complex  procedure  involving  nested  projections  and  pseudo-inversions. 

Our  work  [7] ,  [9]  in  the  past  year  has  had  as  its  goal  the  development 
of  a  methodology  for  analyzing  (1.1)  which  combines  the  generality  of  [1] 
with  the  intuitive  and  algebraically  simple  results  associated  with  models 
such  as  (1.4).  We  have  now  done  this,  and  not  only  does  this  provide  a 
very  clear  connection  between  our  earlier  work  and  the  work  of  others ,  but 
it  also  establishes  an  algebraic  framework  for  examining  numerous  other 
problems  involving  systems  with  multiple  time  scales . 

The  basis  for  our  approach  is  to  view  A(£)  as  a  matrix  over  the 
(local)  ring  W  of  functions  of  £  that  are  analytic  at  £  =  0.  The  matrix 
A(£)  then  has  a  Smith  decomposition 

A(£)  =  P(£)D(£)Q(£)  (1.5) 

where  P(£)  and  Q(£)  are  unimodular ,  i.e.  |p(0)|  ^  0,  f Q (0 ) |  /  0  and 
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i.  i 

D(e)  =  diag  (E  I,,---,  I  )  (1.6) 

1  m 

where  0  <  i,  <  *  *  *  <  i  ,  and  the  identity  matrices  I .  may  have  different 
-  1  m  j 

dimensions.  (We  have  assumed  here  that  A(£)  is  nonsingular  on  some  interval 


of  the  form  (0,  £  ) ;  in  the  more  general  case  D(£)  would  also  include  a  zero 
diagonal  block) .  The  diagonal  elements  of  D(£)  are  the  invariant  factors  of 


As  described  in  [7],  because  of  the  unimodularity  of  P(£) ,  one  can  show 
that  a  uniform  approximation  of  x(t)  with  the  same  time  scale  structure  as 


(1.1)  is  P (0) z (t) ,  where 


Z  =  D  ( £)  Az  ,  A  =  Q  (0)  P  (0) 


(1.7) 


Note  that  this  system  is  in  a  form  that  is  the  natural  generalization  of 
(1.4) : 


o  r«\i  Au 


Z  £  2A  £  2A 

2  *  .21  .22 


i  1 

•  _  m  m 

Z  £  A  1  £  A  _ 

m  ml  m2 


.  ■  £  A^m 


12 

£  A^m 


(1.8) 


Consequently  there  is  a  relatively  straightforward  procedure  —  involving 
successive  Schur  complements  of  A  —  for  determining  if  (1.8)  (and  hence 
(1.1))  has  well-behaved  time  scales,  and  for  constructing  the  transformation 
which  brings  (1.8)  (and  thus  (1.1))  into  diagonalized  time-scale  form  (as 
in  (1.3))- 

This  result  is,  in  our  opinion,  of  great  significance,  as  it  makes 
clear  the  essential  algebraic  nature  of  the  problem  of  time-scale  decom¬ 
positions.  In  particular,  it  establishes  the  fact  that  the  invariant 
factors  of  A(£)  determine  the  time  scale  structure  of  (1.1) .  This  opens 


the  door  to  the  detailed  examination  of  numerous  other  problems ,  some  of 
which  we  have  begun  to  examine  and  others  which  wait  for  the  near  future. 
Some  of  these  are  as  follows : 

(1)  Important  issues  in  the  design  of  feedback  control  for  complex 
systems  are  the  way  in  which  feedback  couplings  should  be  structured  and  the 
effect  such  couplings  have  on  overall  system  performance.  Such  effects 

can  be  quite  dramatic,  as  has  been  documented  in  numerous  examples  of  control 
designs  based  on  reduced-order  models  (neglecting,  for  example,  fast 
dynamics)  which  lead  to  closed- loop  instabilities  or  severe  performance 
degradations.  A  natural  question  to  ask  in  the  context  we  have  described 
is  the  effect  of  feedback  on  time-scale  structure.  From  our  results  we 
see  that  a  key  question  is  that  of  invariant  factor  assignment  —  i.e. 
the  changing  of  invariant  factors  (and  hence  time  scales)  by  feedback. 

We  have  obtained  some  important  results  in  this  area,  but  we  defer  dis¬ 
cussion  of  these  to  the  following  section. 

(2)  The  key  computational  aspect  of  our  time  scale  decmoposition 
procedure  is  the  determination  of  the  Smith  decomposition  (1.5),  (1.6). 

We  have  been  able  to  relate  this  computation  explicitly  to  the  projections 
and  pseudo-  inversions  in  our  earlier  method  [2] .  Procedures  exist  for 

J- 

the  computation  of  Smith  forms, '  and  we  have  begun  to  examine  the 
implementation  of  such  algorithms  for  the  numerical  computation  of  time 

Tp.  Van  Dooren,  P.  Dewilde,  and  J.  Vandewalle,  "On  the  Determination  of 
the  Smith-McMillan  Form  of  a  Rational  Matrix  from  Its  Laurent  Expansion," 
IEEE  Trans.  Circuits  and  Systems,  March  1979 . 

G.  Verghese  and  T.  Kailath,  "Rational  Matrix  Structure,"  IEEE  Trans.  Aut. 
Control ,  Vol.  AC-26,  No.  2,  pp.  434-439,  April  1981. 


scale  decompositions  of  systems  of  the  form  of  (1.1) . 


(3)  The  form  (1.5),  (1.6)  provides  us  with  the  basis  for  answering 


another  question  we  had  posed  in  our  proposal  for  this  project.  Specifically, 


consider  the  problem  of  characterizing  matrices  A^(£)  and  A^fe)  that  have 
identical  time-scale  behavior  —  i.e.  they  both  have  the  same  approximation 
in  Cl. 2)  —  so  that  the  difference  (£)  -  A^(£)  is  a  regular  perturbation 
of  A^(£).  Clearly  one  necessary  condition  is 


D  (El  =  D  t£) 


(1.9) 


This,  together  with  the  condition 


PL(0)  -  P2(0) ,  Ql(0)  =  Q2(0) 


form  a  set  of  sufficient  conditions,  but  the  latter  is  not  necessary  because 
of  the  nonuniqueness  of  P  and  Q  in  the  Smith  decomposition.  More  generally 
one  could  allow 


pl<0)  =  P2(0)R  ,  Q(0)  *  R  a2i 


(1.10) 


where  R  is  a  block-diagonal  similarity  transformation  with  blocks  of  sizes 
equal  to  those  in  (£) ,  but  even  this  is  not  a  necessary  condition,  since 
one  can  essentially  add  slow  modes  to  faster  ones  without  affecting  asymp¬ 
totic  behavior.  For  example, 


A  (6)  = 


-1  0 


0  -£ 


Ve) 


1  1  -1  0 


-1  -£ 


*  *'«  4  »  .  **  .*  »*  .*  ,*  *  •  .*  _■  *  ^.  .  .  .  •  m 


have  the  property  that 


Ax(e)t  A2 (e) t 

lim  sup  | |e  -  e  | |  =  0 

£+0  t>0 

This  suggests  that  upper-block  diagonal  transformations  (with  identities 
along  the  diagonal)  on  the  right  of  P  do  not  affect  asymptotic  equivalence, 
and  the  same  is  true  of  lower-block  diagonal  transformations  (with  identities 
along  the  diagonal)  .on  the  left  of  Q.  We  are  presently  completing  the 
verification  of  necessary  and  sufficient  conditions  along  these  lines  (and 
also  allowing  transformations  as  in  (1.10))  for  our  notion  of  asymptotic 
equivalence . 

(4)  The  fact  that  our  method  yields  an  algebraic  connection  between 
(1.1)  and  the  explicit  form  of  (1.7)  provides  us  with  a  basis  for  evaluating 
bounds  on  convergence  rates  of  time-scale  approximations.  Specifically, 

we  have  begun  to  investigate  the  construction  of  such  bounds  in  terms  of 
| |p(e)  -  P(0) |  |,  | | Q (£)  -  Q(0) | | ,  and  the  Schur  complement  structure  of 
Q(0) P (0) .  The  work  described  in  (6)  to  follow  on  higher  order  corrections 
to  the  time-scale  approximation  (see  (1.19)  -  (1.22))  will  also  be  of 
value  here  in  providing  a  method  for  pinpointing  the  lead-order  error 
terms. 

(5)  Points  (3)  and  (4)  above  are  extremely  important  in  shedding  light 
on  theoretical  problems  in  practical  time-scale  decomposition.  In  parti¬ 
cular,  it  is  rarely  the  case  that  a  system  is  given  in  the  form  (1.1)  with 

a  small  parameter  identified.  Rather,  one  is  interested  in  starting  with 


a  system  in  the  form 
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identifying  a  small  number  £  and  writing 


F  =  .E  B . £ 
1=0  l 


(1.12) 


so  that  the  resulting  time-scale  approximation  based  on  this  representation 
is  accurate  to  within  some  prescribed  tolerance.  This  problem  raises 


numerous  issues.  For  example,  which  small  elements  of  F  should  be  viewed 

2 


as  order  1,  as  order  £,  as  order  £  ,  etc.?  Intuitively  one  must  have  that 
| | P {£)  -  P(0) | |  is  sufficiently  smaller  than  the  minimum  singular  value 
of  P(0)  (similarly  for  Q(£)  and  Q(0))  so  that  it  is  fair  to  view  the 
difference  P(£)  -  P(0)  as  a  negligible  (i.e.  regular)  perturbation  of  F 
in  the  sense  of  (3) .  We  plan  to  build  on  such  insights  in  order  to 
develop  a  constructive  procedure  for  determining  accurate  time  scale 
decompositions  of  systems  as  in  (1.11). 


(6)  There  are  numerous  ties  to  other  research  areas  —  almost- 
invariant  subspaces  and  implicit  systems  (i.e.  systems  of  the  form 

Ex  *  Ax) ,  to  name  two  —  which  we  feel  can  be  illuminated  significantly 
using  the  algebraic  framework  we  have  developed. 

(7)  We  have  also  begun  to  use  the  results  obtained  to  date  to 
examine  more  closely  the  conditions  A(E)  must  satisfy  in  order  for  there 
to  exist  an  approximation  as  in  (1.2).  In  particular,  there  are  two  ways 
in  which  a  system  can  fail  to  satisfy  the  conditions.  Either  A(£)  vio¬ 
lates  the  so-called  multiple  semisimple  nullstructure  (MSSNS)  condition 
(2]  or  it  satisfies  this  but  violates  the  multiple  semistability 
condition  (MSST) .  If  the  first  of  these  is  violated  it  means  that  in 
the  procedure  for  constructing  the  A^  in  (1.2)  (either  as  in  [1]  or  via 


Schur  complementation  as  in  [7],  [9])  one  encounters  one  of  these  matrices 


which  has  a  Jordan  block  of  size  greater  than  lxl  corresponding  to  the 
zero  eigenvalue.  For  example. 


A(£) 


(1.13) 


is  asymptotically  stable  for  any  £  >  0  but  it  does  not  have  a  uniform 
approximation  as  in  (1.2)  Note  that  the  Jordan  form  of  A(e)  changes 
abruptly  at  £  =0  which  indicates  that  there  must  exist  a  singularity 

at  £  =  0  in  the  similarity  transformation  that  brings  A(£)  to  Jordan 
form.  For  this  reason  we  have  begun  to  study  alternative  canonical  forms 
of  A (£)  and  in  particular  the  relationship  between  the  Smith  decomposition 
and  the  eigenstructure  of  A(£) .  A  conjecture  which  we  are  in  the  process 
of  examining  is  that  the  eigenvalues  of  A(£)  are  of  orders  of  £  exactly 
equal  to  the  invariant  factors  if  and  only  if  A(E)  has  MSSNS.  This  fact, 
if  true,  will  also  be  of  great  value  in  several  control  problems  we  are 
considering. 

The  point  we  have  just  made  —  that  a  violation  of  the  MSSNS  con¬ 
dition  corresponds  to  singularities  in  the  similarity  transformation  that 
brings  A(£)  to  Jordan  form  —  suggests  another  related  problem  on  which 
we  have  made  progress  recently.  Specifically  this  problem  is  concerned 
with  using  £-dependent  similarity  transformations  of  A(£)  which  are  singular 
at  0  and  cancel  the  singularities  in  the  Jordan  similarity  transformation. 
That  is,  if  A(£)  does  not  have  MSSNS,  we  are  concerned  with  constructing 
a  transformation  T(e)  so  that 

A {£)  =*  T(£)A(£)T_1(£)  (1.14) 


does  have  MSSNS.  What  this  in  essence  does  is  identify  those  components 
of  x(t)  which  require  scaling,  an  idea  which  has  been  used  by  Sannuti 


and  Wason*.  Again  our  work  to  date  indicates  that  the  algebraic  approach 
we  have  developed  provides  precisely  the  correct  framework  for  answering 
this  question  in  the  simplest  and  most  illuminating  fashion.  As  am 
example,  consider 


A(e)  = 


(1.15) 


If  we  scale  the  state 


x  =  T(£)x  = 


£  0 
0  1 


x 


(1.16) 


we  find  that 

-e  e 
o  -e 


A(e)  = 


(1.17) 


which  does  satisfy  the  MSSNS  condition.  Note  that  the  invariant  factors 

2 

of  A (£)  are  1  and  £  ,  while  both  invariant  factors  of  &(£)  are  £  ,  in¬ 
dicating  this  this  system  has  only  one  time  scale. 

In  [11]  we  have  considered  an  alternative  approach  to  the  problem 
of  deriving  approximations  for  a  particular  class  of  systems  which 
violate  the  MSSNS  condition.  In  particular  in  this  paper  we  examine  a 
specific  model  structure  commonly  used  in  analyzing  interconnected 
power  systems.  Specifically  we  have  considered  models  of  the  form 

P.  Sannuti  and  H.  Wason,  "Singular  Perturbation  Analysis  of  Cheap  Control 
Problems,"  Proc.  22nd  IEEE  Conf.  on  Decision  and  Control,  San  Antonio, 
Texas,  Dec.  1983 


P.  Sannuti  and  H.  Wason,  Int.  J.  Contr. ,  Vol.  37,  1983,  pp.  1259-1286. 


where  F(£)  is  an  infinitesimal  stochastic  matrix.  Because  of  this,  F(£) 
has  a  fixed  zero  eigenvalue  and  thus  A(0)  doesn't  have  semisimple  null 
structure.  However,  using  the  results  in  [2]  on  aggregation  of  finite- 
state  Markov  processes  we  are  able  to  derive  an  approximation  which  in 
essence  corresponds  to  keeping  the  dominant  term  is  each  element  of 
exp(A(£)t}.  For  the  class  of  systems  considered  in  [11]  this  can  be 
accomplished  in  a  relatively  simple  and  intuitively  appealing  fashion. 

We  are  at  present  considering  generalizations  to  other  systems,  and  the 
development  of  a  precise  definition  of  the  way  in  which  one  should  think 
of  this  approximation  as  being  good.  For  example,  x(t)  »  et  is  a  "good" 
approximation  of  x(t)  =  e^1+E^t  in  the  sense  that  the  coefficient 
multiplying  t  in  the  exponent  of 

x  (t) 

xTt) 

is  of  strictly  higher  order  in  £  (which  is  not  true  of  e^0'9^  or 

(l.l)t  -  ,  . 

e  ,  for  example) . 

The  second  way  in  which  a  system  can  fail  to  have  an  approximation 
as  in  (1.2)  is  if  it  satisfies  the  MSSNS  condition  but  not  the  .VSST 
condition.  This  could  happen  for  one  of  two  reasons.  One  possibility 
is  that  there  are  unstable  poles  such  as  (£  +  s^) .  The  leading  term 
approximation  described  in  the  preceding  paragraph  is  aimed  at  such  a 
situation.  The  other  possibility  is  that  A(£)  is  stable  for  •;  s  0 
but  some  of  the  eigenvalues  of  one  of  the  in  (1.2)  are  purely 
imaginary.  This  corresponds  to  a  situation  in  which  the  rate  of 


oscillation  in  a  complex  mode  is  of  lower  order  (and  hence  faster)  than 
the  damping.  Consider  for  example 


A(£)  * 


1 

-e 


(1.18 


— £t 

which  yields  responses  of  the  form  e  sin  t. 

What  we  are  considering  in  such  cases  is  the  inclusion  of  higher- 
order  terms  in  the  asyptotic  expansion,  or,  equivalently  allowing  the 
A.  in  (1.2)  to  violate  the  condition  A. A.  =0.  To  see  how  such  a 
decomposition  might  be  obtained,  consider  y(t)  =  P  1(£)x(t).  Then 


y (t)  =  D(£)A(E)y(t)  ,  A(£)  =  Q(S)P(£) 


(1.19 


Compare  this  to  the  process  z(t)  defined  in  (1.7).  If  we  define  the 
"correction  process" 

w ( t )  -  e~D(£)^ty(t)  (1.20 

we  find  that 


w(t)  »  [-D(E) A  +  e“D(E)AtD(E)A(£)eD(£)At]  w(t) 


(1.21. 


and  an  investigation  of  the  structure  of  the  matrix  in  (1.2)  should 
identify  the  desired  higher-order  corrections .  As  a  very  simple  example , 
consider  again  (1.18).  In  this  case 


w  <  t) 


w(t) 


(1.22) 


Our  present  work  along  the  lines  just  described  is  quite  close  to 
providing  a  general  procedure  for  approximating  dynamics  of  the  form 
of  (1.1)  which  violate  the  conditions  for  (1.2)  to  exist.  Such  a  pro¬ 
cedure  would  involve  £-dependent  similarity  transformations  to  obtain  a 


transformed  system  which  satisfies  the  MSSNS  condition,  leading-order 


approximations  for  unstable  modes  of  systems  which  satisfy  MSSNS  but  have 
complex  poles  with  real  parts  of  higher  order  than  imaginary  parts. 

As  an  example  of  a  system  which  requires  two  of  these  steps ,  consider 
again  A(E)  in  (1.13).  If  we  scale  A(£)  as  in  (1.14)  with 


T(£) 


£l/2  0 

0  1 


(1.23) 


we  obtain 


X(e)  = 


-e  e 


-£1/2  £ 


1/2' 


(1.24) 


which  is  essentially  of  the  same  form  as  in  (1.18)  (one  need  only  identify 
1/2 

£  as  the  fundamental  parameter  rather  than  £  and  perform  a  simple 


time  scaling) . 
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II.  Control  and  Estimation 

Our  research  in  this  area  has  had  two  major  thrusts.  The  first  of  these 
builds  directly  on  the  tools  decribed  in  the  preceding  section.  Specifically 
we  have  focused  attention  on  an  examination  of  systems  of  the  form 

X  (t )  =  A(£)x(t)  +  B  (£) U  (t)  (2.1) 

y (t)  =  C(£)x (t)  (2.2) 

Our  ultimate  aim  is  to  develop  a  complete  picture  of  how  time-scales,  weak 
couplings,  and  differences  in  the  scales  of  controllability  and  observability 
of  various  components  of  the  state  and  in  the  weightings  of  states  and  controls 
in  the  system  design  criterion  interact  in  determining  the  structure  of 
control  designs.  Our  goal  is  to  develop  constructive  procedures  for  designing 
hierarchical  or  decentralized  control  systems- which  take  into  account  these 
scaling  differences  to  achieve  nearly  optimal  performance . 

Notable  contributions  have  been  made  on  various  aspects  of  this  subject, 
but  much  remains  to  be  done .  Our  results  to  date  indicate  that  the  algebraic 
approach  outlined  in  the  preceding  section  provides  an  excellent  framework 
for  examining  this  subject,  for  obtaining  results  which  extend  considerably 
what  is  known  at  present,  and  for  shedding  substantial  light  on  the  nature 
of  problems  of  this  type  by  uncovering  and  explicitly  examining  the  critical 
mathematical  constructs  which  form  the  heart  of  these  problems.  For  example, 
as  indicated  in  the  preceding  section,  our  results  have  shown  that  the 
invariant  factors  of  A(e)  determine  the  open-loop  time  scales  of  (2.1)  , 
assuming  that  the  MSSNS  condition  is  satisfied  (if  is  is  not,  some  scaling 
must  be  performed  as  described  in  Section  I) .  Consequently  a  natural 
question  to  ask  is  to  determine  precisely  how  the  invariant  factors  of 


(2.1)  can  be  modified  by  feedback  of  the  form 
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U  (t)  =  K(£)x(t)  (2.3) 

(where  K(£)  is  again  a  matrix  over  the  ring  of  functions  analytic  at  £=0)  . 
By  allowing  e-dependence  in  (2.3)  we  are  in  essence  considering  the 
question  of  feedback  structure.  That  is,  the  matrix  K(0)  determines  which 
states  are  strongly  coupled  to  which  controls,  the  matrix 

K(e)-K(0) 

e 

determines  the  next  level  of  coupling  in  the  hierarchy,  etc. 

At  this  time ,  we  have  obtained  important  results  on  invariant  factor 
assignment  [7],  [9].  In  particular  if  A(£)  and  B(e)  are  left  coprime, 
i.e.,  if  [A (0) :B (0) ]  has  full  row  rank,  then  the  closed-loop  system  matrix 

F(£)  »  A<£)  +  B (£) K (£)  (2.4) 

can  have  no  more  than  b  =*  rank  B(0)  non-unit  invariant  factors,  and  these 

■^1  ^b  0 

b  factors  can  be  made  to  equal  an  arbitrary  set  £  ,...,£  (with  £  =1, 

CO 

£  *  0)  by  an  appropriate  choice  of  K(£)  which  we  can  construct  explicitly. 

This  result  opens  the  way  for  the  consideration  of  numerous  other  problems: 

(1)  Precisely  how  can  the  eigenvectors  of  F(e)  be  controlled  as  well 
as  the  invariant  factors?  That  is,  how  can  we  influence  which  states 
evolve  at  which  time  scales? 

(2)  Can  K (£)  be  chosen  so  that  desired  invariant  factors  are  achieved 
and  F(£)  has  MSSNS?  If  not,  characterize  the  required  scaling  of  F(£). 

(3)  Note  that  if  b  *  rank  B(0)  <  m  =  rank  B(e)  for  ~>0,  our  result 
indicates  that  fewer  time  scales  can  be  affected  than  we  have  independent 
controls.  In  such  a  case,  some  of  the  controls  are  uniformly  weak,  and  the 
only  way  in  which  time  scales  could  be  influenced  in  general  is  by  high 
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gain  —  i.e.  by  allowing  terms  of  the  form  1/e  in  K(e)  or  equivalently  by 
allowing  input  scaling 

u(t)  =  S  (£)u  (t)  (2.5) 

so  that 

x(t)  =  A(£)x(t)  +  B(E)u(t)  (2.6) 

where  B(e)  »  B(e)S  (e)  is  still  analytic  at  e=0  and  has  the  property  that 
B(0)  is  of  full  rank.  For  example,  the  time  scale  of 

x  *  -x  +  eu  (2.7) 

can  be  charged  by  using  feedback  of  the  form 

u  -  (^  +  K(e))x  (2.8) 

(4)  Another  important  problem  is  the  case  when  [A(e):B(E)]  does  not 
have  full  row  rank.  In  this  case,  there  are  two  avenues  of  investigation. 
The  first  of  these  involves  the  use  of  scaling  of  the  inputs  and  possibly 
the  states  to  achieve  the  coprime  and  B(0)  full  rank  conditions.  In  our 
other  approach,  we  suppose  that  we  are  restricted  to  using  K(e)  which  are 
analytic  at  e=0  (and  thus  perform  no  input  scaling).  In  this  case  F(e) 
is  of  the  form 

F (e)  =  w(e)?(e) ,  ? (e)  =  A(e)  +  I(e)k(e)  (2.9) 

where  W(£)  is  a  greatest  common  left  divisor  of  A(E),  B(e),  and  A(£) ,  B(e) 
are  coprime.  If  the  invariant  factors  of  F(e),  w(£),  and  F(e)  are  denoted 
by  f ^ (£} ,  w^(£),  and  f  (e)  and  are  ordered  such  that  the  ith  one  divides 
the  (i+l)-th  one,  we  have  that 

w.  (e) If.  (£)  and  f .  (e) if. (e) 

l  i  i'i 


(2.10) 
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The  first  condition  shows  that  every  invariant  factor  of  P(e)  must  contain 
the  corresponding  invariant  factor  of  W  (e) .  The  f^(£)  are  governed  by  our 
result  in  the  coprime  case,  and  thus  some  conclusions  about  the  f  ^ (£)  can 
be  drawn  from  the  second  divisibility  condition  in  (2.10).  Note  that  this 
does  not  provide  a  complete  solution,  and  open  questions  remain.  In 
particular,  in  [9]  we  present  a  result  on  one  set  of  conditions  under  which 
f^(e)  =  w^fejf^te).  Work  on  more  complete  characterizations  of  f^(e)  in  other 
cases  is  continuing. 

(5)  There  are  close  ties  between  our  work  and  several  other  research 
areas  which  we  have  begun  to  explore  and  develop.  In  particular,  questions 
such  as  (1)  are  related  to  the  much  broader  subject  of  the  geometric 
structure  of  (2.1),  which  in  turn  has  ties  to  the  work  on  almost  (A,B) 
invariant  subspaces  of  Willems*.  In  our  case“we  have  additional  structure, 
however,  provided  by  the  various  scales  defined  by  increasing  orders  in 
e.  Also,  just  as  the  work  of  Willems  has  close  ties  to  the  topic  of  high 
gain  feedback,  so  does  our  work,  and  we  plan  to  explore  this  avenue.  In 
particular  we  have  begun  to  examine  the  interpretation  and  extension  of  the 
approach  of  Sannuti  and  Wason  (referenced  earlier)  to  our  framework.  As 
a  final  point,  we  note  that  it  is  certainly  possible  to  consider  choices 
of  input  scaling  so  that  B(e)  has  singularities  at  e=0.  This  appears  at 
least  cosmetically  to  be  more  closely  tied  to  work  such  as  that  on  cheap 
control  and  high-gain  feedback.  Hcwever,  if  B(e)  contains  terms  of  the  form 
l/en,  a  simple  time  scaling  (so  that  the  fastest  time  scale  is  the  "new" 

J.C.  Willems,  "Almost  Invariant  Subspaces:  An  Approach  to  High  Gain  Feed¬ 
back  Design  —  Part  I:  Almost  Controlled  Invariant  Subspaces," 

IEEE  Trans,  on  Aut .  Control,  Vol.  AC-26,  1982,  pp.  235-252. 
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time  variable  t)  removes  these  singularities.  Consequently  our  investigation 
will  allow  us  to  consider  high  gain  by  a  single  time  scale  identification. 

Another  area  [9]  in  which  we  have  begun  work  is  in  the  examination  of  a 
generalization  of  the  cheap  control  problem  which  allows  the  open- loop  system 
to  have  several  time  scales  and  allows  differences  in  the  scales  at  which 
different  states  and  controls  are  weighted.  Specifically,  consider  the  problem 
of  choosing  a  control  law  for  (2.1)  to  minimize 


J=  /  [x' (t)Q(e)x(t)  +  u' <t)R(£)u(t)  ]dt 
0 

The  Hamiltonian  matrix  for  this  problem  is 


(2.11) 


h(e)  = 


A(e) 


-Q<£) 


-B (£)R  (E)B'  (£) 


-A' (£) 


(2.12) 


Define  9p(e)  and  0^  (e)  as  the  positive  definite  solutions  of  the  algebraic 
Riccati  equations 

0f(E)A(E)  +A'(£)0F(£)  -  0F(E)B(E)R“1(E)B'  (E)0F(E)  +Q(£)  =0  (2.13a) 

0b(£)A(S)  +  A  *  (£)  ©b  (£)  +  9b(E)B(S)R"1(E)B’  (£)0b(E)  -Q(E)  =0  (2.13b) 


Then  one  can  construct  a  similarity  transformation 


T  (E)  = 


0F(e)  -  l 


©b  (£)  -  I 


(2.14) 


operating  on  H(£)  to  yield 


H  (E) 


-A '  (£)  +  Q.fEjBfEjR^eJB'  (E) 


•A’  (£)  -  0.  (e)B(£)R_1  (E)B’  (£) 

D 

(2.15) 


•  •  .  »  *  •  • 


/  .* 


which  indicates  the  well-known  result  that  the  eigenvalues  of  H(£)  are  also 
those  of  the  optimal  closed- loop  system.  This  suggests  that  if  H(£)  has 
MSSNS,  invariant  factor  analysis  of  H(£)  may  yield  the  time-scale  structure 
of  the  closed-loop  system.  However,  the  similarity  transformation  bringing 
H(e)  to  the  form  (2.15)  is  unimodular  if  and  only  if  0p(£)  +  is 

unimodular,  which  will  not  be  the  case  in  nearly  singular  control  problems. 
This  obviously  points  to  the  need  for  scaling  and  to  the  roles  of  0  (£)  , 
0^(£),  and  H (£)  in  determining  the  requisite  scaling  and  the  resulting  time 
scale  structure.  Sannuti  and  Wason  have  investigated  this  point  in  the 

special  case  in  which  R(£)  =  £R  is  the  only  £-dependence  (see  also  the 

+ 

closely  related  and  important  work  of  Hautus  and  Silverman  ) .  We  are  now 
involved  in  examination  of  the  general  problem  we  have  posed  using  the 
algebraic  framework  we  have  developed,  and  the ^extension  of  this  problem 
to  include  E-dependent  observations  in  order  to  achieve  our  objective  of 
developing  a  complete  picture  of  the  interplays  among  scales  on  open-loop 
dynamics,  control  effectiveness,  observability ,  and  weightings  on  inputs 
and  states. 

We  have  also  made  progress  in  our  research  involving  estimation  of 
finite-state  Markov  processes  (FSMP's)  possessing  several  time  scales. 

The  basis  for  this  research  is  the  methodology  developed  in  [2]  which 
uses  our  results  on  decomposing  systems  of  the  form  (1.1)  and  the  basic 
properties  of  FSMP's  to  construct  a  hierarchy  of  simpler,  aggregated  models 
of  FSMP's  which  contain  rare  transitions.  Each  model  ignores  transitions 
that  occur  at  a  time  scale  far  greater  than  the  one  with  which  the  model 
is  concerned  and  aggregates  the  effects  of  transitions  that  occur  at  faster 

M.L.J.  Hautus  and  L.M.  Silverman,  "System  Structure  and  Singular  Control," 

Linear  Algebra  &  Applications,  Vol.  50,  pp.  369-402,  1983. 


scales.  The  existence  of  such  a  hierarchy  suggests  the  use  of  estimator 
structures  which  take  advantage  of  such  a  decomposition  of  the  underlying 
process,  thereby  offering  the  possibility  of  reducing  extremely  complex 
estimation  problems  to  sets  of  far  simpler  ones. 

Our  research  in  this  area  has  consisted  of  two  distinct  pieces. 

One  the  one  hand  we  have  made  progress  in  performing  detailed  asymptotic 
analyses  of  very  simple  singularly  perturbed  FSMP  estimation  problems  [12] 
and  this  work  has  produced  both  several  important  insights  into  what  types 
of  performance  measures  are  important  for  such  estimators  and  an  analytical 
approach  for  calculating  asymptotic  approximations  to  such  measures.  The 
other  portion  of  our  work  L8]  ,  [10]  has  dealt  directly  with  a  class  of  FSMP's 
of  great  complexity  but  which  also  possess  important  structural  features. 

Our  objective  in  this  area  has  been  to  develop  estimator  structures  that 
take  direct  advantage  of  this  structure .  By  doing  so ,  it  has  been  our 
hope  to  uncover  important  principles  and  concepts  that  could  then  be  used 
both  for  designing  estimators  for  other  classes  of  problems  and  for  suggesting 
promising  and  important  theoretical  directions. 

The  research  described  in  [10]  has  as  its  motivation  the  automated 
analysis  of  electrocardiograms  (ECG's).  Our  reason  for  choosing  this 
focus  is  not  only  that  ECG  analysis  is  an  important  and  challenging  problem 
but  also  that  it  is  necessary  to  establish  a  context  for  an  investigation 
of  this  type.  The  class  of  "large  and  complex  FSMP's"  is  far  too  amorphous 
to  yield  interesting  insights  and  analysis;  what  is  needed  is  to  define 
a  structured  class  of  FSMP's  with  clear  estimation  objectives.  Thus  an 
accurate  statement  is  that  ECG  analysis  has  guided  the  choices  of  estimation 
structures  and  problems  we  are  investigating,  but  that  the  class  of  problems 
we  are  considering  is  by  no  means  restricted  to  ECG  analysis  and  includes 


numerous  other  complex  signal  analysis  problems  as  well  as  topics  such  as 
multitarget  tracking  and  coup lex  queueing  networks. 

To  be  more  specific,  the  problems  we  have  been  analyzing  are  hybrid  in 
nature  —  that  is,  they  involve  both  discrete-  and  continuous-valued 
processes,  where  one  can  think  of  sequences  of  discrete  states  as  events 
which  influence  the  observed  continuous  waveforms.  In  particular,  the  type 
of  model  that  we  are  considering  consists  of  an  interconnection  of  discrete- 
state  processes  where  the  state  of  one  process  can  influence  the  transition 
rates  in  the  other  processes  (as  we  will  point  out  shortly,  this  is  precisely 
how  one  can  interpret  the  results  of  [2]),  and  particular  transitions  in 
some  of  these  processes  initiate  the  generation  of  continuous  waveforms. 

The  actual  observation  is  the  superposition  of  the  continuous  waveforms 
that  have  been  generated  (just  as  the  ECG  is  the  superposition  of  the 
measured  electrical  activity  of  the  various  regions  the  heart. 

In  [8] ,  [10]  a  methodology  is  developed  for  modeling  cardiac  activity 
and  in  particular  its  effect  on  the  observed  ECG  using  models  of  this  type. 
These  models  have  several  very  important  aspects.  Two  of  these  are  timing 
and  control.  The  issue  of  control  is  related  to  the  fact  that  the 
electrical  state  of  one  portion  of  the  heart  —  represented  by  one  of  the 
finite-state  processes  in  the  model  —  can  strongly  influence  the  future 
behavior  of  other  portions  of  the  heart.  The  issue  of  timing  is  concerned 
with  the  fact  that  one  can  observe  dramatic  differences  in  the  influence 
the  s'- ate  of  one  portion  of  the  heart  can  have  on  another,  depending  upon 
the  state  the  other  portion  is  in  (see  [8]  and  [10]  for  numerous  examples) 

A  third  extremely  important  aspect  of  these  cardiac  models  is  that  the  time 
scale  at  which  interactions  among  the  discrete  models  change  and  at  which 
continuous  waveforms  are  initiated  is  far  slower  than  the  transition-by- 


transition  scale  at  which  each  process  evolves.  It  is  this  feature  that 
suggests  a  decomposition  of  the  estimator  (which  processes  the  observed 
ECG  in  order  to  track  cardiac  activity)  in  which  the  estimator  for  each 
subprocess  has  a  highly  aggregated  model  of  the  remainder  of  the  overall 
process  that  is  accurate  enough  at  the  coarse  time  scale  at  which  it  is 
important.  This  leads  to  estimation  structures  consisting  of  interconnections 
of  discrete- state  estimators  which  take  as  inputs  the  observed  ECG  and 
estimates  from  other  local  estimators  and  which  produce  estimates  of  state 
trajectories. 

In  the  recent  past  we  have  been  developing  and  analyzing  estimators 
for  processes  that  possess  the  features  we  have  just  described.  Our  analysis 
has  been  driven  by  concerns  that  differ  from  those  which  are  usually 
considered  in  examining  estimator  performance  but  which  are  quite  natural 
for  discrete  processes  of  the  type  we  have  described  and  in  particular  for 
the  ECG  problem.  In  particular,  in  usual  estimation  problems  one  measures 
performance  by  comparing  the  actual  state  and  the  estimate  at  a  particular 
point  in  time .  In  discrete  event-oriented  problems  such  as  ECG  analysis  one 
is  more  interested  in  the  timing  of  events  (especially  those  which  determine 
the  control  behavior  of  the  heart) .  Thus  one  is  concerned  with  errors  in 
time  corresponding  to  particular  values  of  the  state  or  state  transition. 

That  is,  an  estimate  x  may  be  considered  to  be  quite  good  even  if  x(t)  -  x(t) 
is  often  quite  large  if  in  fact  the  state  and  estimate  trajectories  have 
only  small  time  shifts  between  them.  A  second  important  performance  measure 
is  error  recovery,  a  concept  that  is  most  easily  stated  in  coding  terms. 
Specifically,  if  we  think  of  the  observations  (e.g.  the  ECG)  as  an  encoding 
of  discrete  events,  then  we  would  like  our  decoder  (estimator)  to  have  the 
property  that  the  occurrence  of  inevitable  decoding  errors  should  not  lead 


to  long  strings  of  subsequent  decoding  errors. 


The  other  portion  of  our  research  on  FSMP's  [12]  deals  with  the  detailed 
asymptotic  analysis  of  a  simple  FSMP  estimation  problem.  While  this  problem 
is  not  rich  enough  to  capture  all  aspects  of  the  concerns  described  in  the 
preceding  paragraph,  it  has  proved  to  be  extremely  useful  in  allowing  us 
to  begin  to  develop  quantitative,  analytic  methods  for  problems  of  this  type. 
Just  as  in  the  control  work  described  earlier  in  this  section,  our  funda¬ 
mental  interest  in  this  problem  is  to  understand  how  process  time  scale , 
observability  (i.e.  measurement  information  rates)  and  estimation  criterion 
interact . 

A  first  problem  considered  in  [12]  is  the  simple  two-state  process 
x (t)  depicted  in  Figure  2.1  where  X^  and  X^  are  of  the  same  order  of  magnitude 
and  where  we  have  observations 

dy (t)  »  h(x(t))dt  +  bdw(t)  (2.16) 

where  E[dw2(t)]  =  dt.  Letting  ir^  =  Prob  {x  (t)  =  l|y(s),  0<s<t}  (2.17) 

ir^  =  Prob  (x(t)  =  l|y(s)  ,  CKs<t} 


we  can  write 

d-rr1(t)  =  [-X17T1(t)  +  X2(l-771(t))]dt 

+  \  [h  (t)  -  h  (it  (t) )  ]  [dy  (t)  -  h(7r  (t))dt] 
b  1  1 

where 

ii(TT1(t))  -  h(l)ir  (t)  +  h(2)  (1-it  (t)) 


(2.18) 


(2.19) 


As  discussed  in  [12]  there  are  four  natural  quantitative  measures  for  the 
performance  of  this  filter: 


(1)  Filter  bias.  This  is  the  distance  between  the  equilibrium 
value  of  n  (t)  given  that  x(t)  =  1  or  2  respectively  and 


the  corresponding  boundary  —  i.e.  ff  ■  1  if  x{t)  *  1  or 
n  ■  0  if  x{t)  =0.  This  yields  a  measure  of  the  ability 
to  distinguish  between  the  two  states. 

(2)  Variance.  The  variance  of  deviations  of  tt  around  its 
equilibrium  points  excluding  large  deviations  (i.e.  false 
alarms) .  This  corresponds  most  closely  to  the  usual  notion 
of  estimation  performance . 

(3)  Detection  delays.  The  time  it  takes  the  filter  to  evolve  from 
one  equilibrium  point  to  a  detection  threshold  near  the  other 
equilibrium  point  following  a  transition  in  x  (t) . 

(4)  Mean  time  between  false  alarms.  The  expected  time  between 
crossings  of  the  threshold  corresponding  to  the  incorrect 
value  of  a(t)  given  that  x (t)  has  not  changed. 


All  of  these  involve  examining  (2.13)  assuming  x(t)  *  1  or  x(t)  =2. 

If  x  (t)  =  1  <^yer  the  interval  of  interest  it  (t)  evolves  according  to 

\ 

dir^t)  =  [-A^rr^  (t)  +  d-7r1  (t) )  3dt 

+  k2it  (t>  (1-TT  (t))2dt  +  Kir  (t)  (l-ir^tnawCt)  (2.20 


where 


(2.21 

is  the  rate  at  which  information  is  accumulated  that  distinguishes  between 
the  hypotheses  x(t)  =  1  and  x(t)  =  2.  If  x (t)  -2  over  the  time  interval, 
then 


„2  A  /h(t)-h(t) 
=  1  b 


dTT1(t)  =  t-A  TT  (t)  +  A2  (l-TTl  (t)  )  ]dt 

-  K2TT  (t)2(l-ir1(t))dt  +  KTT1(t)  (1-TT  (t))dw(t)  (2.22 

Note  that  in  either  case  tt  (t)  is  a  diffusion  process  on  the  bounded  domain 
[0,1]  and  in  fact  the  boundaries  of  this  domain  are  so-called  entrance 


boundaries . 


-2: 


The  evaluation  of  the  performance  measures  described  previously  involves 

the  detailed  study  of  (2.20)  and  (2.22) .  For  example,  the  equilibria  required 

to  determine  filter  bias  are  the  stationary  points  of  deterministic  systems 

obtained  by  setting  the  noise  terms  to  zero  in  these  equations.  Note  also 

that  it  is  here  that  we  begin  to  see  a  need  for  asymptotic  analysis  —  we 

2 

have  two  rates,  one  determined  by  K  and  one  by  X  =  (X^  +  X 2 ) /2 .  It  should 
therefore  not  be  surprising  that 


(which  roughly  has  the  interpretation  as  the  expected  amount  of  information 

collected  between  x(t)  Transitions  )  is  the  critical  quantity  in  evaluating 

asymptotic  approximations  to  the  performance  measures  just  described.  In 

particular,  we  have  analyzed  in  detail  the  case  where  X  *  0  (£)  ,  i.e.  where 

Y  is  large  (O(^0).  If  we  think  of  defining  detection  thresholds  at  values 

=  5  and  *  1-6,  we  have  determined  that  6  must  be  chosen  very  carefully 

as  a  function  of  e  in  order  to  obtain  detection  delays  that  are  small  compared 

to  the  time  between  transitions  and  also  to  avoid  catastrophic  streams  of 

false  alarms.  In  particular  a  choice  of  5(e)  =  0(Ve")  leads  to  detection 

2 

delays  which  go  to  infinity  but  at  a  much  slower  rate  ((-log6  (e) )  /K  )  than 
the  time  between  transitions  (0  (^j ) .  Thus  the  estimator  is  correct  "most 
of  the  time".  Also,  with  this  choice  of  threshold  there  will  be  0(1)  false 
alarm  between  x(t)  transitions. 

The  problem  just  described  serves  as  a  first  step  in  analyzing  the 
two-time-scale  process  (x^  (t) ,  x^  (t) )  illustrated  in  Figure  2.2  with 
measurements 


dyL(t)  =  h^x^tHdt  +  b1dw1(t)  (2.24) 

dy2(t)  ■  h2(x,(t))dt  +  b2dw2(t) 


v.n’.v.x ■ 


(2.25) 
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First  note  that  using  the  analysis  in  [2]  this  process  can  be  decomposed 
into  a  hierarchy  of  two  two-state  process:  a  slow  process  corresponding 
to  transitions  in  x^(t)  and  a  fast  process  corresponding  to  x2  (t) .  Note 
that  this  structure  is  exactly  of  the  form  under  investigation  in  [8] , 
[10]  —  an  interconnection  of  two  processes  in  which  rare  transitions 
in  the  x^(t)  process  influence  the  transition  rates  (y^  and  y2  vs.  ri^  and 
n2)  of  the  fast  process. 

The  structure  of  the  estimator  under  investigation  is  the  following: 


The  measurements  y.^  (t)  are  processed  using  the  aggregate  two- 
state  Markov  model'1' for  x^(t)  evolving  at  the  slow  time  scale. 
Since  the  difference  between  the  actual  evolution  of  x  (t) 
and  that  predicted  by  the  approximate  model  is  0(e) ,  the 
conclusions  described  previously  for  the  two-state  process 
hold  here  as  well. 


Given  the  estimate  x  (t) ,  an  estimate  of  x  (t)  is  generated 
by  using  y2  and  the  two-state  model  for  x  It)  corresponding 
to  x. (t) .  ^Performance  here  can  be  evaluated  in  a  fashion  similar 


to  tiat  described  for  the  process  in  Figure  2.1.  Once  can 
evaluate  the  performance  when  x^  is  correct  or  in  error,  but 
the  difference  is  significant  only  if  the  y's  and  n's  are  of 
different  orders  of  magnitude . 

The  estimator  structure  described  by  (1) ,  (2)  is  not  nearly 
optimal,  but  performs  well  under  a  wide  range  of  conditions. 
The  reason  for  this  suboptimality  is  that  there  may  exist 
nonnegligible  information  in  y2  concerning  x  —  whether  this 
difference  is  significant  or  not  depends  on  the  size  of  the 
differences  between  the  y ' s  and  n's.  Note  that  even  if 
this  difference  is  of  no  major  consequence  for  estimating 
x2,  it  may  be  significant  for  estimating  x^,  since  x^  changes 
at  a  far  slower  time  scale  (and  thus  information  can  be 
accumulated  over  a  much  longer  time  period) .  We  are  presently 
completing  our  analysis  of  how  the  information  in  y can  be 
incorporated  into  the  estimation  of  x^.  The  basic  idea  is 
the  following.  Let  h2(x^)  be  defined  as  the  expected  value 
of  h2(x2(t))  given  that  x^  is  the  correct  value  and  x2 (t) 
has  reached  its  ergodic  distribution.  That  is 


V1’  V2> 


(2.26) 


n2  ni 

— 7—  h,  (1)  +  h  (2) 

Vn2  2  Vn2  2 


(2.27) 
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h2  (2) 


.’ft*. 


Then  the  measurement  can  be  written  as 

dy2(t)  =h2(x1(t))dt  +  [h2(x2(t)>  -  h2(x1(t))]dt  +  b2dw2  (t)  (2.28) 

Intuitively,  if  the  information  rate 


is  comparable  to  or  greater  than 


one  would  expect  y7  to  be  of  value  in  estimating  x  .  Furthermore,  if  the 
conditional  distribution  of  x^  evolves  at  a  slower  time  scale  than  the 
process  x2»  one  would  expect  that  the  term  in_brackets  in  (2.28)  is 
negligible  as  far  as  estimation  is  concerned  (although  it  is  all-important 
as  far  as  the  estimation  of  x2  is  concerned!).  In  this  case,  we  can  use  the 
approximation 

dy2 (t)  »  h2(x1(t))dt  +  b2dw2(t)  (2.29) 

for  the  x^-estimator  which  then  has  a  form  analogous  to  (2.18)  except  that 
it  is  driven  by  both  observations. 
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